61 research outputs found

    GRU-D-Weibull: A Novel Real-Time Individualized Endpoint Prediction

    Full text link
    Accurate prediction models for individual-level endpoints and time-to-endpoints are crucial in clinical practice. In this study, we propose a novel approach, GRU-D-Weibull, which combines gated recurrent units with decay (GRU-D) to model the Weibull distribution. Our method enables real-time individualized endpoint prediction and population-level risk management. Using a cohort of 6,879 patients with stage 4 chronic kidney disease (CKD4), we evaluated the performance of GRU-D-Weibull in endpoint prediction. The C-index of GRU-D-Weibull was ~0.7 at the index date and increased to ~0.77 after 4.3 years of follow-up, similar to random survival forest. Our approach achieved an absolute L1-loss of ~1.1 years (SD 0.95) at the CKD4 index date and a minimum of ~0.45 years (SD0.3) at 4 years of follow-up, outperforming competing methods significantly. GRU-D-Weibull consistently constrained the predicted survival probability at the time of an event within a smaller and more fixed range compared to other models throughout the follow-up period. We observed significant correlations between the error in point estimates and missing proportions of input features at the index date (correlations from ~0.1 to ~0.3), which diminished within 1 year as more data became available. By post-training recalibration, we successfully aligned the predicted and observed survival probabilities across multiple prediction horizons at different time points during follow-up. Our findings demonstrate the considerable potential of GRU-D-Weibull as the next-generation architecture for endpoint risk management, capable of generating various endpoint estimates for real-time monitoring using clinical data.Comment: 30 pages, 7 figures, 4 supplementary figure

    A Novel Energy-Efficient Approach for Human Activity Recognition

    Get PDF
    In this paper, we propose a novel energy-efficient approach for mobile activity recognition system (ARS) to detect human activities. The proposed energy-efficient ARS, using low sampling rates, can achieve high recognition accuracy and low energy consumption. A novel classifier that integrates hierarchical support vector machine and context-based classification (HSVMCC) is presented to achieve a high accuracy of activity recognition when the sampling rate is less than the activity frequency, i.e., the Nyquist sampling theorem is not satisfied. We tested the proposed energy-efficient approach with the data collected from 20 volunteers (14 males and six females) and the average recognition accuracy of around 96.0% was achieved. Results show that using a low sampling rate of 1Hz can save 17.3% and 59.6% of energy compared with the sampling rates of 5 Hz and 50 Hz. The proposed low sampling rate approach can greatly reduce the power consumption while maintaining high activity recognition accuracy. The composition of power consumption in online ARS is also investigated in this paper

    Colorectal Cancer with Residual Polyp of Origin: A Model of Malignant Transformation

    Get PDF
    AbstractThe majority of colorectal cancers (CRCs) arise from adenomatous polyps. In this study, we sought to present the underrecognized CRC with the residual polyp of origin (CRC RPO+) as an entity to be utilized as a model to study colorectal carcinogenesis. We identified all subjects with biopsy-proven CRC RPO+ that were evaluated over 10 years at Mayo Clinic, Rochester, MN, and compared their clinical and pathologic characteristics to CRC without remnant polyps (CRC RPO−). Overall survival and disease-free survival overlap with an equivalent hazard ratio between CRC RPO+ and RPO− cases when age, stage, and grade are adjusted. The somatic genomic profile obtained by whole genome sequencing and the gene expression profiles by RNA-seq for CRC RPO+ tumors were compared with that of age -and gender-matched CRC RPO− evaluated by The Cancer Genome Atlas. CRC RPO+ cases were more commonly found with lower-grade, earlier-stage disease than CRC RPO−. However, within the same disease stage and grade, their clinical course is very similar to that of CRC RPO−. The mutation frequencies of commonly mutated genes in CRC are similar between CRC RPO+ and RPO− cases. Likewise, gene expression patterns are indistinguishable between the RPO+ and RPO− cases. We have confirmed that CRC RPO+ is clinically and biologically similar to CRC RPO− and may be utilized as a model of the adenoma to carcinoma transition

    Mass Homozygotes Accumulation in the NCI-60 Cancer Cell Lines As Compared to HapMap Trios, and Relation to Fragile Site Location

    Get PDF
    Runs of homozygosity (ROH) represents extended length of homozygotes on a long genomic distance. In oncology, it is known as loss of heterozygosity (LOH) if identified exclusively in cancer cell rather than in matched control cell. Studies have identified several genomic regions which show consistent ROH in different kinds of carcinoma. To query whether this consistency can be observed on broader spectrum, both in more cancer types and in wider genomic regions, we investigated ROH patterns in the National Cancer Institute 60 cancer cell line panel (NCI-60) and HapMap Caucasian healthy trio families. Using results from Affymetrix 500 K SNP arrays, we report a genome wide significant association of ROH regions between the NCI-60 and HapMap samples, with much a higher level of ROH (11 fold) in the cancer cell lines. Analysis shows that more severe ROH found in cancer cells appears to be the extension of existing ROH in healthy state. In the HapMap trios, the adult subgroup had a slightly but significantly higher level (1.02 fold) of ROH than did the young subgroup. For several ROH regions we observed the co-occurrence of fragile sites (FRAs). However, FRA on the genome wide level does not show a clear relationship with ROH regions

    Comparison of Three Information Sources for Smoking Information in Electronic Health Records

    No full text
    Objective The primary aim was to compare independent and joint performance of retrieving smoking status through different sources, including narrative text processed by natural language processing (NLP), patient-provided information (PPI), and diagnosis codes (ie, International Classification of Diseases, Ninth Revision [ICD-9]). We also compared the performance of retrieving smoking strength information (ie, heavy/light smoker) from narrative text and PPL Materials and Methods Our study leveraged an existing lung cancer cohort for smoking status, amount, and strength information, which was manually chart-reviewed. On the NLP side, smoking-related electronic medical record (EMR) data were retrieved first. A pattern-based smoking information extraction module was then implemented to extract smoking-related information. After that, heuristic rules were used to obtain smoking status-related information. Smoking information was also obtained from structured data sources based on diagnosis codes and PPI. Sensitivity, specificity, and accuracy were measured using patients with coverage (ie, the proportion of patients whose smoking status/strength can be effectively determined). Results NLP alone has the best overall performance for smoking status extraction (patient coverage: 0.88; sensitivity: 0.97; specificity: 0.70; accuracy: 0.88); combining PPI with NLP further improved patient coverage to 0.96. ICD-9 does not provide additional improvement to NLP and its combination with PPI. For smoking strength, combining NLP with PPI has slight improvement over NLP alone. Conclusion These findings suggest that narrative text could serve as a more reliable and comprehensive source for obtaining smoking-related information than structured data sources. PPI, the readily available structured data, could be used as a complementary source for more comprehensive patient coverage

    A New Image Denoising Method by Combining WT with ICA

    No full text
    In order to improve the image denoising ability, the wavelet transform (WT) and independent component analysis (ICA) are both introduced into image denoising in this paper. Although these two algorithms have their own advantages in image denoising, they are unable to reduce noises completely, which makes it difficult to achieve ideal effect. Therefore, a new image denoising method is proposed based on the combination of WT with ICA (WT-ICA). For verifying the WT-ICA denoising method, we adopt four image denoising methods for comparison: median filtering (MF), wavelet soft thresholding (WST), ICA, and WT-ICA. From the experimental results, it is shown that WT-ICA can significantly reduce noises and get lower-noise image. Moreover, the average of WT-ICA denoising image’s peak signal to noise ratio (PSNR) is improved by 20.54% compared with noisy image and 11.68% compared with the classical WST denoising image, which demonstrates its advantage. From the performance of texture and edge detection, denoising image by WT-ICA is closer to the original image. Therefore, the new method has its unique advantage in image denoising, which lays a solid foundation for the realization of further image processing task

    A critical review on the bio-removal of hazardous heavy metals from contaminated soils: Issues, progress, eco-environmental concerns and opportunities

    No full text
    Mechanism of four methods for removing hazardous heavy metal are detailed and compared-chemical/physical remediation, animal remediation, phytoremediation and microremediation with emphasis on bio-removal aspects. The latter two, namely the use of plants and microbes, are preferred because of their cost-effectiveness, environmental friendliness and fewer side effects. Also the obvious disadvantages of other alternatives are listed. In the future the application of genetic engineering or cell engineering to create an expected and ideal species would become popular and necessary. However, a concomitant and latent danger of genetic pollution is realized by a few persons. To cope with this potential harm, several suggestions are put forward including choosing self-pollinated plants, creating infertile polyploid species and carefully selecting easy-controlled microbe species. Bravely, the authors point out that current investigation of noncrop hyperaccumulators is of little significance in application. Pragmatic development in the future should be crop hyperaccumulators (newly termed as "cropaccumulators") by transgenic or symbiotic approach. Considering no effective plan has been put forward by others about concrete steps of applying a hyperaccumulator to practice, the authors bring forward a set of universal procedures, which is novel, tentative and adaptive to evaluate hyperaccumulators' feasibility before large-scale commercialization. (C) 2009 Elsevier B.V. All rights reserved
    • …
    corecore